1 Are Efficient Natural Language Parsers Robust ?
نویسندگان
چکیده
This paper discusses the robustness of four efficient syntactic error-correcting parsing algorithms that are based on chart parsing with a context-free grammar. In this context, by robust we mean able to correct detectable syntactic errors. We implemented four versions of a bottom-up error-correcting chart parser: a basic bottom-up chart parser, and chart parsers employing selectivity, top-down filtering, and a combination of selectivity and a top-down filtering. The combined selectivity and top-down filtering parser was the most efficient. However, this parser failed to correctly repair more sentences than the other parsers, failing on 18 out of 119 ill-formed sentences, compared to no failures for the basic bottom-up chart parser. This paper examines trade-offs between parsing efficiency and robustness at the syntactic level.
منابع مشابه
A General Method for Transforming Standard Parsers into Error-Repair Parsers
A desirable property for any system dealing with unrestricted natural language text is robustness, the ability to analyze any input regardless of its grammaticality. In this paper we present a novel, general transformation technique to automatically obtain robust, error-repair parsers from standard non-robust parsers. The resulting error-repair parsing schema is guaranteed to be correct when ou...
متن کاملComparative Study of GLR Parser with Finite-state Predictors and Chart-based Semantic Parsers
The natural language processing component of a speech understanding system is commonly a robust, semantic parser, implemented as either a chart-based transition network, or as a generalized left right (GLR) parser. In contrast, we are developing a robust, semantic parser that is a single, predictive finite-state machine. Our approach is motivated by our belief that such a finite-state parser ca...
متن کاملError-repair parsing schemata
Robustness, the ability to analyze any input regardless of its grammaticality, is a desirable property for any system dealing with unrestricted natural language text. Error-repair parsing approaches achieve robustness by considering ungrammatical sentences as corrupted versions of valid sentences. In this article we present a deductive formalism, based on Sikkel’s parsing schemata, that can be ...
متن کاملMulti-Strategy Construction-Specific Parsing for Flexible Data Base Query and Update
The advantages of a multi-strategy, construction-specific approach to parsing in applied natural language processing are explained through an examination of two pilot parsers we have constructed. Our approach exploits domain semantics and prior knowledge of expected constructions, using multiple parsing strategies each optimized to recognize different construction types. It is shown that a mult...
متن کاملA Finite-State Parser with Dependency Structure Output
Dependency parsers and nite-state parsers are both capable of rapid and robust parsing of natural language. Dependency parsers produce richer output structures, while nite-state parsers can be more eecient. We show how a nite-state parser can be used to produce dependency structures for most phrase types, with an O(n 2) complexity in the number of words. The parser allows syntactically ambiguou...
متن کامل